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Abstract — In this paper we use an afflne connection for- 
mulation to study an optimal control problem for a class of 
nonholonomic, under-actuated mechanical systems. In partic- 
ular, we aim at minimizing the norm-squared of the control 
input to move the system from an initial to a terminal state. 
We consider systems evolving on general manifolds. The class 
of nonholonomic systems we study in this paper includes, 
in particular, wheeled-type vehicles, which are important for 
many robotic locomotion systems. The two special aspects of 
this optimal control problem are the nonholonomic constraints 
and under-actuation. Nonholonomic constraints restrict the 
evolution of the system to a distribution on the manifold. The 
nonholonomic connection is used to express the constrained 
equations of motion. Furthermore, it is used to take variations 
of the cost functional. Many robotic systems are under- 
actuated since control inputs are usually applied through the 
robot's internal configuration space only. While we do not 
consider symmetries with respect to group actions in this 
paper, the fact that the system is under-actuated is taken 
into account in our problem formulation. This allows one to 
compute reaction forces due to any inputs applied in directions 
orthogonal to the constraint distribution. We illustrate our 
ideas by considering a simple example on a three-dimensional 
manifold. 

I. Introduction 

In this paper we use the theory of affine connections 
to study force minimizing optimal control problems for 
a large class of nonholonomic under-actuated mechanical 
systems. Mechanical systems considered in this paper may 
be nonlinear and evolving on algebraic (for holonomically 
constrained systems) and/or abstract manifolds such Lie 
groups (in particular, the group of rigid body motions in 
three dimensional space, SE(3), and its subgroups). The 
class of nonholonomic systems we study in this paper 
includes, in particular, any wheeled-type vehicle, such as 
robots on wheels and or tracks. The fact that most of 
these robotic systems apply torques and forces internal 
to the system, which makes these system move in an 
undulatory fashion (see [1] and references therein for more 
on undulatory locomotion), without the application of any 
external forces, makes the system under-actuated. In fact, 
control inputs that are applied through the shape space in 
the absence of any control authority through the group space 
(that is, the fiber.) Hence, including under-actuated systems 
in our study is crucial in covering a wide range of robotic 
applications. 
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Nonholonomic mechanical control systems have a long 
and complex history which is described in, for example, 
[2], [3] (in particular. Chapter 5) and [4]. Of much inter- 
est in the present work are the recent developments that 
utilize a geometric approach [5], [3] and, in particular, the 
theory of affine connections [6], [4]. These methods offer 
a coordinate-free differential approach to mechanics and 
control that avoids many of the issues that arise in classical 
mechanics such singularity and change of coordinates, 
complexity of notation and the lack of a geometric picture. 
For more on differential-geometric mechanics and its use 
in the context of dynamics and control, we refer the reader 
to [7], [3], [4]. For the treatment of under-actuated systems 
using affine connections, we refer the reader to [8]. 

Aside from [9], [10], previous results usually treat kine- 
matic systems that usually aim at minimizing energy. In 
this paper the cost function is the square of the norm of the 
total applied control. We treat second order (i.e., dynamic) 
nonholonomic systems and allow for under-actuation. As 
will be seen in this paper, the set of necessary optimal- 
ity conditions are coordinate-free and generic for a large 
class of nonholonomic mechanical systems. Given problem- 
specific data, one can specialize the result to the specific 
problem at hand. This process can be automated using 
symbolic manipulation packages such as Mathematica® and 
toolboxes such as those introduced in [11]'. 

While most of the systems appearing in robotics nat- 
urally posses symmetries with respect to a group action, 
which leads to the reduced equations of motion for the 
system, in this work we provide a framework for treating 
nonholonomic systems in the context of optimal control 
using the theory of affine connections, regardless of the 
presence of any symmetries. In the case where symmetries 
do exist, one can usually do more by utilizing the structure 
of the equations of motion as done in [9], [10]. In [9], 
[10], however, the authors use the momentum equation 
form of the reduced equations of motion [5]. The problem 
of optimally controlling systems with symmetry will be 
treated in a future paper In particular, we are interested 
in understanding how results based on an affine connection 
approach and Lagrange's multiplier method relate to results 
based on the momentum equation form that appear in [9], 
[10]. For more on systems with symmetry we refer the 
reader to, for example, [1], [5], [7], [12], [13] and references 
therein. 

' These packages are available online for which a reference is provided 
in [11]. 



The paper is arranged as follows. In Section n, we briefly 
describe how nonholonomic mechanical systems are treated 
using the theory of affine connections. We also state the 
relationship of this approach to the Lagrange-d'Alembert 
equations of motion for nonholonomic systems. In Section 
III, we introduce the optimal control problem and derive 
the necessary optimaUty conditions using the theory of 
affine cormections. In Section IV, we use the vertical coin 
(equivalently, the inline or ice skate) as a simple example 
to illustrate how to perform the computations. Finally, in 
Section V, we summarize our results and describe areas of 
current and future research. 

II. Review of Affine Differential Geometry and 
Nonholonomic Systems 

A. Riemannian Manifolds and Affine Connections 

In this section we give brief definitions of the various 
objects from affine connection theory that are essential 
to this paper. For more complete studies, we refer the 
reader to the mathematically-oriented text [14] or the more 
mechanically-oriented text [4]. 

Let Q be a smooth (C°°) Riemannian manifold with the 
Riemannian metric defined by (?q : TqQ x TqQ ^ R at 
some point q e Q, where TQ = UqTqQ is the tangent 
bundle of all tangent space TqQ at all points q G Q. 
Thus the length of a tangent vector Vq e TqQ is given 

by Vflq(Vq,Vq). 

A Riemannian connection on Q, denoted V, is a mapping 
that assigns to any two smooth vector fields X and Y on 
Q a new vector field, VxY. For the properties of V, we 
refer the reader to [14], [15], [3]. The operator Vx, which 
assigns to every vector field Y the vector field VxY, is 
called the covariant derivative of Y with respect to X. 

The Lie bracket of the vector fields X and Y will be 
denoted by [X, Y] and is defined by the identity: [X, Y]/ = 
X(Y/) - Y(X/), for all C°° functions / : Q ^ M. Given 
vector fields X, Y and Z on Q, define the vector field 
n (X, Y) Z by the identity 

7e(X,Y)Z = VxVyZ- VyVxZ-V[x,y]Z. (1) 

TZ is trilinear in X, Y and Z and is a tensor of type (1, 3), 
which is called the curvature tensor of Q. 

Finally, we will employ the musical isomorphism (j^ : 
T*Q ^ TQ (called the "sharp") and its inverse bg : TQ 
T*Q (the "flat") associated with the metric g and defined 
by the relation T*Q 3 Y^(X) = g(Y, X), for all X G TQ. 
The sharp is induced from the definition of the flat. 

B. Nonholonomic Systems and the Constrained Affine Con- 
nection 

In this section we introduce the affine connection view- 
point of mechanical control systems. The discussion pre- 
sented here is based on the material found in [6], [4], [16]. 
Let Q be a C°° n-dimensional manifold with the tangent and 
cotangent bundles denoted by TQ and T*Q, respectively. 
An under-actuated constrained simple mechanical control 



system is given by the quadruple (Q, g, V), where gfq : 
TqQ X TqQ ^ R is the kinetic energy (Riemannian) 
metric on Q at q G Q. The collection of covectors T = 
{fi,F2,...,Fp} g T*Q, p < n, is a set of linearly 
independent 1 -forms on Q that represent the directions of 
the forces and torques acting on the system given by 
p 

T = ;^TiF*GrQ. (2) 

i=l 

Hence, the system is underactuated with underactuation 

degree n — p. The subspace V is an {n — m)-dimensional 
nonholonomic distribution on Q, where the m constraints 
are given by 

<(vq) = 0, z = l,...,m, (3) 

where uj* G T*Q are one-forms on Q. In this paper 
we only consider systems that evolve in a potential-free 
environment. The presence of a potential does not intro- 
duce additional theoretical challenges to our treatment and, 
hence, we omit it here for the sake of simplicity. In a future 
archival version of this work, we wiU include potentials to 
study a wider range of nonholonomic systems. 

For a simple mechanical control system without a poten- 
tial the Lagrangian L : TqQ ^ IR is given by 

i^(q,v) = ^ff(v,v). (4) 

The Lagrange d' Alembert principle then gives the following 
equations of motion 

d dL dL , 1 , 

d^a^-a^ = E^M+E-^F^ (5) 
1=1 1=1 

where are Lagrange multipliers such that A = 
J2'j=i represents reaction forces. The system of equa- 
tions (5) is equivalently written using the affine connection 
as 

Vv(t)v(t) = A(t)»«+u(i) 

m = v(i) (6) 

where V is the Levi-Civita connection compatible with 
the metric g, X{t) is a section of V-^ (the ^-orthogonal 
complement of V) and 

u{t) = T«»(t) = x^nw {F^y- = j2nm,, (7) 

1=1 i=l 

where Yj = (F')'" are the corresponding input vector 
fields. 

If we define P : TQ ^ D C TQ and Q : TQ ^ D-^ C 
TQ to be the complementary gi-orthogonal projectors, then 
the equations (6) are equivalently written as 

Vv(t)V(t) = P(u(i)) 

q(i) = v(t), (8) 



where now we only require that the initial velocity be 
v(0) G to ensure that the flow remains on the constrained 
disctribution. The connection V is called the nonholonomic 
affine connection and is given by 

VxY = VxY + (VxQ)(Y) 

= (VxY) + Vx (Q(Y)) , (9) 

for all X, Y G TQ. Note that VxY G V for all Y G D and 
X G TQ [4], [6]. The constrained connection also appears 
in [17]. We now give further properties of the nonholonomic 
connection V, in particular, how they operate on functions 
and one-forms. 

Lemma ILL Vx./ = Vx./ for all / G C°°(Q). 

Proof. This is obvious since for any affine connection V 
we have Vx/ = 'Cx/ = the Lie derivative of / 

with respect to the vector field X. This is true since the Lie 
derivative C is independent of the choice of V. ■ 

Lemma II.2. For all A G T*Q we have 

VxA- VxA-(VxQ)*(A), 

for all X G TQ, where * denotes the adjoint of a map. Note 
here that Q (and P) is a (1, 1) tensor and so is VxQ and 
its adjoint VxQ- 

Proof. Given our knowledge of how V acts on vector fields 
(equation (9)) and Lemma II. 1, we have 

VxA(Z) = Vx(A(Z))-A(VxZ) 

= Vx(A(Z))- A(VxZ + (VxQ)(Z)) 
= (VxA)(Z)+A(VxZ)-A(VxZ) 

-A((VxQ)(Z)) 
= (VxA)(Z)-((VxQ)*(A))(Z) 

for all vector fields X, Z G TQ. For the first equality we 
used Vx (A(Z)) = (VxA) (Z) + A ( VxZ) . ■ 
Finally, recall the definition of the curvature tensor TZ, 
which arises naturally in higher order optimal control 
problems, associated with an affine connection V given 
by equation (1). Associated with the nonholonomic affine 
connection is the nonholonomic curvature tensor, denoted 
1Z, that also satisfies equation (1) but with V replacing 
V everywhere. We have the foUowing observation for the 
nonholonomic curvature tensor TZ. 

Lemma n.3. The nonholonomic curvature tensor satisfies 

:^(x,Y)z = 7e(x,Y)z 

+ ((VxVy - VyVx - V[x,Y]) Q) (Z) 

+ (VxQ) [(VyQ) (Z)] - (VyQ) [(VxQ) (Z)] 

for ah X, Y, Z G TQ. 

Proof. The proof comes from the definition in equation 
(1) of the nonholonomic curvature tensor in terms of the 
nonholonomic connection. One then uses the definition of 
the nonholonomic connection in equation (9) to substitute 



V in the equation for TZ in (1). The rest of the proof is 
straightforward algebraic operations. ■ 

III. Optimal Control of a Nonholonomic System 

In this section we introduce the optimal control problem 
and derive the necessary optimahty conditions. In this 
paper, we use Lagrange's multiplier method for constrained 
problems in the calculus of variations. We only investigate 
normal extremals, which is a reasonable assumption for sim- 
ple mechanical control systems that occur in engineering. 
We also note that, while the system is controlled through the 
shape space only and is, hence, inherently, under-actuated, 
a basic assumption is that the system is controllable. 

Problem m.l. Minimize 

J{t)= £ ^g''{u,u)dt (10) 

subject to the dynamics given in equation (8) and some 
initial and terminal conditions q(0) and q(T), respectively. 
The (0, 2) tensor is the standard identity metric on R". 
The state q(T) is assumed to be reachable by the system 
from q(0). 

Note that we want to minimize the norm of u as opposed 
to the norm of its projection P{u). In other words, gener- 
ally, the above formulation does not attempt to minimize the 
constrained apphed torques. Intuitively, one forecasts that 
no control forces and torques should be applied in directions 
that violate the constraints since these will be squandered 
by creating only more reaction forces (that maintain the 
constraints) with no net useful motion, or by violating 
the constraints altogether. For example, for the rolling 
vertical coin [5], if excessive torque is applied in the rolling 
direction, the rolling constraint may be violated. Moreover, 
the application of any side forces will not contribute to 
the net motion of the system due to the strict no-side-shp 
(the "knife edge") constraint. As will be shown below, it 
turns out that the control will be constrained to lie in V as 
expected, hence, not allowing for violation of the constraints 
or the application of unnecessary control. 

In undulatory locomotion, which is of main interest in 
this work, by definition, we usually require that the control 
be applied through the shape (internal configuration) space 
only. Hence, we need to impose the constraint that the 
generahzed control vector field in the group directions be 
zero. We do this as follows. Let F^, i = 1, . . . ,n — 
form an orthogonal set of co- vector fields complement to the 
CO- vector fields Fj, i = 1,. . . ,p. Then, define a Lagrange 
multiplier one-form ^ = ^jFj such that 

^(u)=0. (11) 

We only treat the general case when Q is an arbitrary 
manifold and speciahze the result to trivial fiber bundles 
in future publications. By trivial fiber bundles we mean 
manifolds of the form Q = G x S, where G is a Lie group 
that represents the fiber, or overall configuration of the 



system, and S is the shape space, or internal configuration, 
of the system. We begin by forming the appended cost 
functional 

i-T 



J 



Jo 2 



g^{u, u)+£(u)+A*(q-v) 



-77 (VvV - P (u)) dt, 



(12) 



where /it, t; € 25*0 are Lagrange multipliers. 
Remark. Let uj^, j = m + 1, . . . , n, be the set of one-forms 
that span V* . This set of one forms along with the one- 
forms ui^, i ~ 1, . . . ,m, form a basis for T*Q. In the above, 
we view e T>*, the co-tangent constraint distribution 
spanned by the one forms w-' for , as a one-form on Q 
such that /Li : I? — > M. On the other hand, it is important 
to note that the derivatives of the velocity vector field is 
general not going to be in the constraint distribution V. 
Recall that if v e X>, then Vxv e P for all X e TQ. 
Hence, the argument of rj is always in V and we view 
2?* 9 ry : P ^ ]R. Moreover, observe that P*u) ~ u) for all 
u> e V*, where V* : V* C TQ -> T*Q is the adjoint of 
the map V. • 
Taking variations of equation (12) we obtain 



9 



VwU,u + ^« 



dt 



fi \ —w - Vwv 

+ V (VwVvV - (VwV) (u) - V (Vw (u))) dt, 

where Vw is the covariant derivative with respect to the 
variation vector field W e TQ given by 

dq{t,e) 



W 



de 



(13) 



with q(i, e) being the one-parameter variation of the optimal 
curve q_{t). In the above expression, we used the fact 
that VxA (Z) = (VyiX^ (Z) -|- A (VxZ^, for any affine 

connection V, vector fields X and Z and any co-vector 
field A (see page 78 in [18]). When A is a Lagrange 
multiplier, say fi or rj, the terms (VwM) (q — v) and 
(Vw^) (VvV — V (u)) give us the equations of motion (8) 
later when we set 6 J equal to zero. Hence, as usually done 
in optimal control theory, these two terms can be omitted 
without affecting the rest of the derivation. 

We now study the term VwVvV. Using the definition 
of the constrained cormection in equation (9) we get 

VwVvV = Vw [V (Vvv) + Vv (Q (v))] . 

However, since the dynamics given by equation (8) guaran- 
tee that V e 7? (assuming v(0) € T>), we have Q(v) = 0. 
Therefore, we obtain 

VwVvV = (Vw^P) (Vvv) + V (VwVvv) . 

Recall that the curvature tensor, TZ, associated with the 
unconstrained connection V, satisfies the identity given by 
equation (1). Since W is arbitrary, and hence independent 
of q, then [W,v] = such that [19] 

VwVvV = Vv Vwv + n (W, v) V. 



Finally, we conclude that 

VwVvV = (Vw?')(Vvv) 

+V (VvVwv + 7^(W,v) v). 
Using this identity, we obtain 

5 J = £ <7^ (Vwu, u + ) + M (^^ W - Vwv) 

+ r]{r (VvVwv + n (W, v) v) 

+ {VwV) (VvV -u)-V (Vw (u)) )dt. 

For the two first terms in the argument of /i and r), we 

integrate by parts and use the fact that W(0) = W(T) = 
Vwv(O) = Vwv(r) = to obtain 

j\(^^W^dt = - j\v^n){W)dt (14) 

/ »7(^(VvVwv))di= - /" Vv(P*(»7))(Vwv)dt 
Jo Jo 

= - [ Vvr/ (Vwv) dt, 

Jo 

where we made use of the fact that V*r) = rj and that 

r, e V *. 

Next, we refer the reader to the properties of a curvature 
tensor TZ defined in terms of the connection V and a 

metric g found in [20], Proposition 2.5 on page 91. From 
these properties, one can show that the curvature satisfies 
g {n (W, v) V, x) = 5 {n (X, v) V, w) , where H is the 

curvature tensor based on a connection V that is compatible 
with g (this is shown in [15], for example). Going through 
the proof of Proposition 2.5 in [20], one can see that all 
the derivation can be generalized to an arbitrary metric g 
and not only to g. Hence, we have g {it (W, v) v, X^ = 

g {iZ (X, v) V, , for any positive definite metric g on 
Q. In the context of our problem, this gives 

77 {V (7^ (W, v) v)) = ff^ ((P*77)««- , n (W, v) v) 

= ff^(w,7^(r7««^,v)v). (15) 

Moreover, note that 

77 (P (Vwu)) = (P*77)(Vwu) 

= 5''(r7"''",Vwu). (16) 

Finally, recall that P is a (1, 1) tensor. Hence Vw'P is also 
a (1,1) tensor (for this check out any book on differential 
geometry, though it is exphcitly stated in [6]). So ( Vw^)* : 
T*Q TQ is the dual of Vw'P- With this observation 
we have 

77(Vw7'(VvV-u)) = ((VwT')**?) (a»«") (17) 

= 5"^(((VwPrT7)"''^AM) 

where we recall from equation (6) that A is the net reaction 
generaUzed force co-vector field. 



Using equations (14)-(17), we find that 



(18) reduce to 



.g^ (^Vwu, u + e^'" - r]h^ 

- Vv/x (W) + n (j7«s^ , v) V ^'^ (W) 
+ ((Vw7')*r,) (aM) 



Two important points need to be emphasized. First, the term 
Vw^ involves variations in the configuration variables 
only since V is an operator that depends only on the 
configuration variables. That is, while V acts on the velocity 
vector field Vq at the point q, T' as a projector depends 
solely on the point q. Thus, this term depends on W only. 

The second observation we wish to make is that Vwu or 
Vwv each can be separated into two terms. For example, 
for Vwu, the first variation term involves only variations 
in the control components r^, i = l,...,m, while the 
second will involve configuration variations W only coming 
through variations of the basis vectors, denoted by Y^. This 
should be realized in order to completely (and rigorously) 
separate variations in configuration, velocity and control 
variables. For more on this, see for example the discussion 
in Section II in [21] and the definitions of the operators 
B and 5 therein. It turns out that if we simply ignore this 
separation step and treat VwU and Vwv as terms that 
involve variations in control and velocity variables only, 
respectively, and no variations in the configuration variables, 
then we end up with exactly the same result. We emphasize 
that the separation step is the correct rigorous mathematical 
approach, while ignoring it is not rigorous albeit reduces the 
number of steps to obtain the same correct result. 

From the above discussion, we realize that since W, 
VwU and Vwv are independent variations and since for a 
normal extremal we must have SJ' ~ 0, we conclude that 
along the optimal trajectory we must have 



u + = 




VvM = 




H 


VvJ? = 







(18) 



where b^E and ttgs are the musical isomorphisms with 
respect to the standard metric . 

Remark. (The Unconstrained, Fully Actuated Problem) 

In the unconstrained case V — I : TQ — > TQ is simply the 
identity map on TQ for all q G Q. Similarly for V*, V* — 
J* : T*Q ^ T*Q is the identity map in T*Q for all q e Q. 
Q will have a null space 7\A(Qq) — TqQ for all q e Q. 
In the fully actuated case we have ^ = 0. Therefore, in the 
unconstrained, fully-actuated case the necessary conditions 



u = 




Vv/i = 




H 


VvJ? = 







These are precisely the result obtained in, say, [21]. • 

IV. Example: The Vertical Coin 

A. Constrained Equations of Motion 

We now give an example to illustrate the above approach 
and compare the result to traditional methods. In this section 
we study optimal control of the vertical coin (i.e., it can 
not fall sideways). The system is shown in Figure 1. The 
mass of the coin is m and its mass moment of inertia about 
the vertical axis is J. The position of the point of contact 
between the coin and the plane is denoted by {qi, 92) while 
its heading direction is denoted by 53 as shown in the figure. 
The configuration q is then given by q = (917 93) and 
the configuration space is simply SE(2). The control input 
is denoted by ui for the force applied to the center of mass 
of the coin and U2 for the torque applied about the vertical 
axis. 




Fig. I. The vertical coin. 

The constraint we have is that the coin can not slip 
sideways (i.e., it satisfies a "knife-edge" constraint). This 
constraint is symbolically expressed in differential form by 
equation (3) with m = 1 and 



Wq = sinqsdqi - cosq3dq2. 



(19) 



Hence, the constraint distribution is given by the span of 
the vector fields 

Xi(q) - 7^ 
dq3 

d d 

X2(q) = cos hsmgs — . (20) 

dqi dq2 



The lagrangian for the vertical coin is given by 

i(q, q) = 5(q, q) = {qf + g|) + ^Jqj 
and, hence, the components of the metric g are given by 

911 = 922 = m, 933 = J (21) 

where all other components are zero. 

We denote the unconstrained connection by V. Since 
the metric is coordinate independent, the unconstrained 
Christoffel symbols Tjf. are all zero. The curvature on 
SE(2) is identically zero. (For why curvature is zero on 
SE(2), see [22].) For the constrained system, one can 
check that the Christoffel symbols corresponding to the 
constrained connection V are given by 

f la = -f 23 = sin(2g3), f I3 - f ^3 = - cos(2g3), (22) 

where all other Christoffel symbols are zero. 
The external generalized force is given by 

T = MlF^+M2F^ (23) 

where one can check that the inputs have the directions 

= cosfjfsdfjfi + cosg3dg2, = dq3. (24) 



The vector fields Y, 
Yi = 
Y2 = 



= (F'j " are then given by 
cos 53 d ^ sin 93 d 



m dq2 



m dqi 
1 d 

J dqs 

such that the input is given by 

Ui cos 53 d ui sin qs d U2 d 



u = 



m 



dq-i 



+ 



m 



dq2 J 9^3 ■ 



(25) 



The projection map V : TQ V is a (1,1) tensor 
whose elements are computed as follows. Let Z G TQ be an 
arbitrary vector field. Then V (Z) = 9 (Z, X,) X^, 

where C*^ is the inverse of the matrix Cy = (7(Xj,Xj), 
i, j = 1,2 [16]. If we have V = V^dqt (g) one can check 
that the components of V are given by 

72 



-pl = cos^ (73, ='Pi = cos q3 sin 53, 



1, 



(26) 



where all other components are zero. Hence, we have 



ui cos 03 d ui sin 03 5 U2 d 

H H -TT^ = u. 



m dqi m dq2 J dqs 

One could have anticipated this result since u is applied in 
directions lying inside the constraint distribution V. This 
gives the right hand side of the nonholonomic equation 
motion given in equation (8). 

Remark. If the force ui apphed to the center of mass was 
restricted to be along the a;-direction, ^(u) will not be equal 
to u. In this case, u will be projected down to the constraint 
distribution by creating a reaction force perpendicular to the 



direction of motion to prevent any motion that violates the 
knife-edge constraint. • 
Finally recall that VvV is given in coordinates by 

3 _ g 



= [qi + 93 {qi sin(2g3) - ^2 cos(2q'3))] 



dqi 



+ [92 + qs {-qi cos(2g3) - q2 sin(2g3))] 



+93 



d 

993' 



dq2 



where we have used the constrained connection coefficients 
given in equation (22). This gives the left hand side of 
equation (8). Hence, the equations of motion are given by 

cos q3Ui 



qi = {-qi sin(2g'3) + ^2 003(253)) qs + 



m 



h = {qi cos(2g3) + 92 sin(2g3)) 93 + ^'"^^^^ (27) 



93 



U2 

J' 

which simplify to 

qi 



m 



qs = 



-92^3 + 

9l93 + 
M2 
J 



cos 93^1 

m 

sin q3Ui 
m 



(28) 



after using the constraints (19). 

B. Optimality Conditions 

We now apply equations (18) for the vertical coin. First, 
we need to find a basis for V* , which we take to be 

Wq = cos(3'3dgi + sing3dQ'2 

cjI = dq3. (29) 

Once can check that these are orthogonal to Wq. Hence, the 
lagrange multiphers are given by 

= fi^ cos qsdqi + jjt^ sin g'3dg2 + M^dgs (30) 



and 



T7 = ri^ojl+rfu}^ 



q ' ■/ -q 

= rf cos g'sdg'i + rf sin g3dg2 + 'rfdq3. (31) 

The under-actuated direction is given by F^ = 
— singsdq'i + cos(73d<72 such that $, = ^iF^ and 
^ttgE = -sing3^i d ^ cos 93^1 d 
m dqi m dq2 
The first of equations (18) then gives 

,2 



ui = mr] 
U2 = Jrf 
6 = 0. 



(32) 



In fact, ^1 is nothing but the generalized reaction force 
created by the knife edge constraint in reaction to any forces 
applied normal to the constraint. 

We now obtain the differential equation for /n. The 
curvature tensor on Q is identically zero in this case. 
Moreover, since there are no external forces and torques 
other than the control inputs, which are applied in T>, then 
the constraint reaction forces A are identically zero in our 
example. Generally the reaction forces A won;t be zero. 
Finally, V has identically zero Christofel symbols. Thus, 
Vv becomes a simple time derivative. Hence, the equation 
for /Lt simply gives 



jJL COS 93 



-.3 



M = 



-yU^ cos 93^3 
0. 



Multiplying the first equation by cos 93 and the second by 
sin 93 and summing we obtain 




0. 



(33) 



This gives the /i. differential equation. 

Finally, we compute Vv^? — —A*. Hence, r] satisfies the 
following differential equation equations 

T}^ cos 93 = -I? cos 93 + rfqs sin 93 



sin 93 



-yu^ sin-r/^93C0S93 



Multiplying the first equation by cos 93 and the second by 
sin 93 and adding both expressions we finally obtain 



(34) 



We see that the necessary conditions for this example 
are particularly simple. This is due to the fact that the 
vertical coin has a manifold SE(2) that is isomorphic to 
X which is a differentially fiat space. This flatness 
and the positive definiteness and convexity of the cost 
functional (10) render the problem convex, with simple 
linear necessary conditions that can be solved analytically 
for the global optimal solution. The necessary conditions 
are solved to get 



ri\t) 



Mo 
Mo 

-Mo* + vl 



(35) 



where /j,q , fif^ , rjf^ , r]f^ are initial conditions determined from 
the boundary conditions on the state of the system q and v. 
These expressions may then be used to compute the optimal 
control law from equation (32). 

In this paper, we give a simple example for the sake 
of transparency of the approach. A longer version of this 



work will include more interesting examples in the context 
of optimal robotic locomotion. 



C. Verification of Results Using Classical Methods 

In this section we verify the necessary conditions ob- 
tained in the last section, which were the coordinate expres- 
sions of equation (18) for the vertical coin. To do so, we 
derive the necessary optimaUty conditions for the optimal 
control problem Problem 111.1 in coordinates using equa- 
tions (27) for the dynamic constraints. The cost function in 
Problem (III.l), in coordinates, reads 



J = 



J2 



Writing equations (27) in first order form 



qi = Vi, 92 = V2, 93 = V3 
COS93U1 

Vi = -V2V3^ =:/i(q,v, u) 



V2 

V3 



V1V3 + 



m 

sin 93^1 _ 
m 

-J =■■ /3(q,v,u) 



=: /2(q,v,u) 



(36) 



the appended cost function is given by 



J = 



-T 



ul 



ul 



2 \rn? 

+Ji^ (91 - vi) + Jl^ (92 - V2) + /i^ (93 - W3) 

+t {Vl - h) + f iV2 - /2) + ff {V3 - h) dt, 



where, following traditional approaches to nonlinear opti- 
mal control theory, we take p, = fi^dqi + p^dq2 + jx^dq^ 
and j) = fj^dqi + ff'dq2 + ffdqz. 

Taking variations of the cost functional we get 



5J 



dui[ — - 7] ?7 

oui aui 



. . (U2 _3 9/3 \ 



X - 1 r-2 X f -3 : -1^/1 , -2^/2 

V 9q3 dq3 

+dvi (-^^ -rj^ - ffvz) 
+5v2 (-/S^ -ff + Tfvz) 

+5v3 (-M^ -rf + tv2 - fvi) . (37) 

Setting 5 J = in equation (37) and by virtue of the 
independence of the variations 5uj, 6qi, Svi, j = 1,2, j = 



1, 2, 3, we obtain the following necessary conditions 
m [rj^ cos qa + ff sin 




\ 2" 



Ml 



(38) 



= i sin(2g3) [{fj^Y - iff] - cos(2g3) 



7? = 



We need to show that these equations are indeed equivalent 
to equations (32), (33) and (34). To do that, let /x and f) be 
related to /lx and r] by 



cos 53, 
/z^ sin 93, 



^ cos 93 
sin 93 



(39) 



77 



One can easily check that substituting these relationships 
into equations (38) and after simple algebraic manipula- 
tions, we obtain equations (32), (33) and (34). It is inter- 
esting to note the particularly simple form of the necessary 
conditions obtained using the affine coimection approach 
when compared to the necessary conditions (38). At first 
glance, equations (38) may not appear to be solvable, 
whereas equations (32), (33) and (34) are clearly much 
easier to study. 

V. Conclusion 

In this paper we used the theory of affine connections 
to study an optimal control problem for a class of non- 
holonomic, under-actuated mechanical systems. The cost 
function is the norm-squared of the control input exerted in 
moving the system from an initial to a terminal state under 
the assumption of controllability. We gave a brief overview 
of some facts from Riemannian geometry and the use of the 
nonholonomic connection to deriving the constrained equa- 
tions of motion. We formulated an optimal control problem, 
where we used the nonholonomic affine connection together 
with Lagrange's multiplier method in the calculus of vari- 
ations to derive the optimal necessary conditions. We gave 
a simple example on a three-dimensional manifold with a 
single nonholonomic constraint that captures the main fea- 
tures of the theoretical result. Future work will focus on the 
treatment of nonholonomic systems with symmetry, which 
naturally occur in robotic locomotion [1]. In particular, we 
are interested in the structure of the resulting optimality 
conditions and the possibility of existence of closed form 
extremals. 
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